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Abstract. We implement Groenendijk and Stokhof's partition seman- 
tics of questions in a simple question answering algorithm. The algorithm 
is sound, complete, and based on tableau theorem proving. The algo- 
rithm relies on a syntactic characterization of answerhood: Any answer 
to a question is equivalent to some formula built up only from instances 
of the question. We prove this characterization by translating the logic 
of interrogation to classical predicate logic and applying Craig's interpo- 
lation theorem. 



1 The Partition Theory of Questions 

An elegant account of the semantics of natural language questions from a logical 
and mathematical perspective is the one provided by Groenendijk and Stokhof 
[|j . According to them, a question denotes a partition of a logical space of possi- 
bilities. In this section, we give a brief summary of this influential theory, using 



a notation slightly different from Groenendijk 's presentation 



A question is essentially a first order formula, possibly with free variables. 
We will denote a question by 7(f), where is a first order formula. (We will also 
denote a set of questions by 7<P, where is a set of first order formulas.) An 
answer is also a first order formula, but one that stands in a certain answerhood 
relation with respect to the question, to be spelled out in Sect.^ For example, 
the statement "Everyone is going to the party" {\fxPx) will turn out to be an 
answer to the question "Who is going to the party?" (IPx). 

We assume that equality is in the language, so one can ask questions such 
as "Who is John?" {?x We also assume that, for every function symbol — 
including constants — it is indicated whether it is interpreted rigidly or not. In- 
tuitively, for a function symbol to be rigid means that its denotation is known. 
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For example, under the notion of answerhood that we will introduce in Sect.^ 
it is only appropriate to answer "Who is going to the party?" [IPx) with "John 
is going to the party" {Pj) if it is known who "John" is — in other words, if j 
is rigid. Also, for "Who is John?" (?x w j) to be a non-trivial question, "John" 
must have a non-rigid interpretation. 

Questions are interpreted relative to first order modal structures with con- 
stant domain. That is, a model is of the form (W,!?, /), where is a set of 
worlds, _D is a domain of entities, and / is an interpretation function assigning 
extensions to the predicates and function symbols, relative to each world. Fur- 
thermore, we only consider models that give rigid function symbols the same 
extension in every world. Relative to such a model M = (VF, D,I), a question ?4> 
expresses a partition of W, in other words an equivalence relation over W: 

[74']m = {{w,v) eW^ \ Wg: M,w,g\:^ M,v,g^ (b} . (1) 

Roughly speaking, two worlds are equivalent if one cannot tell them apart by 
asking the question ?(/>. In general, any set of questions ?<? also expresses a par- 
tition of W, namely the intersection of the partitions expressed by its elements: 

= eW^\W^e^: Vg: M,w,g \^(l)^M,v,g h 0} • 

Entailment between questions is defined as a refinement relation among parti- 
tions (i.e., equivalence relations): An equivalence relation A is a subset of another 
equivalence relation B if every equivalence class of A is contained in a class of B. 

?<P h ?^ iff VAf: [?<?]m C [7^]m . (3) 

A more fine-grained notion of entailment is as follows Q . Let x be a first order 
formula with no free variables, and let M |= x mean that Af, w \= x for 

?<P hx iff VM: Af h X ^ [?<^]m C [7^j]m • (4) 

Pronunciation: The questions ?<? entail the question 7ip in the context of x (or, 
given x) ■ The context x is intended to capture assertions in the common ground: 
If it is commonly known that everyone who got invited to the party is going, 
and vice versa {Wx{Ix ^ Px)), then the questions "Who got invited?" {7Ix) 
and "Who is going?" {IPx) entail each other. 



1.1 Translation to First Order Logic 

Groenendijk and Stokhof |^ do not provide an inference system for the above en- 
tailment relation, but we can use the following entailment-preserving translation 
procedure to ordinary first order logic. 

The intuition behind the translation is simple: One question entails another 
iff the former distinguishes between more worlds than the latter does. In other 
words, one question entails another iff every pair of worlds considered equivalent 
by the first question is also considered equivalent by the second. For instance, 
the question "Who is going to the party?" (IPx) entails the question "Is John 
going to the party?" {7Pj). After all, if exactly the same people are going to 



two parties {\/x{Px <-> P'x)), then either John is going to both parties, or he is 
going to neither of them {Pj ^ P' ])■ 

For any first order formula (p, let (jf be the result of priming all occurrences 
of non-rigid non-logical symbols. Formally, define 

(Pti...t„r y^{f{tl,-..X) if/isrigid 

{s^ty = {s* Kt*) ■'^i'---'"^ otherwise 
{(j,AtP)* = (/,* Atjj* x*=x (5) 

(-0)* = -(0*) T* = T 

(3x0)* = 3xi(j)*) ±* = ± . 

Furthermore, for any question Icj), let be the first order formula Vx(0 <-» (/>*), 
where x are the free variables of 0. For any set of questions ?^, let be the set 
of first order formulas { ?(/)^ | (j) € (!>}. Now. we can reduce entailment between 

questions to ordinary first order entailment, as follows. 

Theorem 1. The following entailments are equivalent. 

1. W |=x ?V' 

2. ?^#,x,X* H^V-* 

Proof. [=>] By contraposition. Suppose ?<?'^,x,X* ^ TV"*- Then there is a first 
order model M = (D, I) that verifies the formulas x, X* but not ?^#. 

Now consider the first order modal structure N = {{w, v}, D, I'), where for all 
non-logical symbols a, we let /^^(a) = and Iy{a) = I{a'). By construction, 
{w,v) e [7(P]n and {w,v) [?'i/']Af- Furthermore, N \= x- Therefore, ^^j- 7tp. 

[<=] Again by contraposition. Suppose Then there is a first order 

modal structure M = {W,D,I) with w,v € W such that {w,v) G [???]m yet 
{w, v) ^ [^iP]m- Furthermore, MjW \= x and M,v \= x- 

Now consider the first order model N = {D,I'), where for all non-logical 
symbols a, we let /'(a) = /u,(a) and I'{a') = /„(«). By construction, TV verifies 
the formulas 7<P*,x, X* but not l^p*. Therefore, x, X* ^ □ 

By this theorem, we can determine whether an entailment between questions 
holds, by translating the formulas involved to ordinary predicate logic. For ex- 
ample, suppose that a is a rigid constant, and consider the supposed entailment 

?Px \= IPa . (6) 

In order to determine whether the entailment is valid, we need only translate 
the two formulas to ordinary first order logic, giving 

V.x(P.x ^ P'x) \^Pa^ P'a . (7) 

As can be easily seen, this is indeed valid. On the other hand, if a were non-rigid 
then the entailment would not be valid. As a second example, consider 

7Px \=yx{Px^Qx) "^Qa . (8) 

Given that a is rigid, this is again valid, as we can see from the translation 

yx{Px ^ P'x),yx{Px ^ Qx),yx{P'x ^ Q'x) \=Qa^ Q'a . (9) 



1.2 Varying Domains 



The partition theory as formulated in the previous section makes some natural 
predictions regarding question entailment. For example, the question "Who is 
going to the party?" [IPx) entails the question "Is everyone going to the party?" 
(lyxPx). Intuitively, whenever one knows (completely) who is going to the party, 
one also knows whether everyone is going to the party. As a matter of fact, much 
of the motivation for the partition theory of questions comes from its natural 
account of entailment relations between sentences embedding such questions.^ 

Unfortunately, the theory also makes some counterintuitive predictions. For 
example, the question 7j w6 ("Is John the same person as Bill?", where j and b 
are interpreted rigidly) is entailed by every question, including the trivial ques- 
tion ?T. Likewise, 73x3y^{x y) ("Does there exist more than one entity?") is 
entailed by every question]^ For this reason, an alternative semantics has been 
discussed, in which the questions are interpreted with respect to first order modal 
structures with varying domains |9|. That is, each world w is associated with 
its own domain of entities D^. This semantics is more general, since it allows 
more models: Every constant domain model is also a varying domain model, so 
fewer entailments between questions are valid for varying domains. In particu- 
lar, IBxBy-i^x w y) is no longer entailed by every question. Also, IPx no longer 
entails l^Px, though the entailment IPx, Ix k, x ^ l^Px remains valid. 

It is not entirely trivial to generalize the partition semantics of questions 
to varying domains, since it is unclear in (^ which assignments g should be 
quantified over. One generalization, which Groenendijk and Stokhof 1^ seem to 
suggest but do not state, is to define 

[1cI,]m ^ {{w,v) eW^ \\lg: 

(g eDtk M,w,g h 0) ^ (g £ Dt & M,v,g h 0) } , 

where denotes the set of assignment functions that map all free variables 
in (j) to entities in Dw Question entailment, as defined in (^, remains the same. 

Various generalizations of the partition theory to varying domains, including 
that in (^0|), can be straightforwardly reduced to the constant domain version by 
introducing an existence predicate E. In the case of ([To|), one would relativize all 
quantifiers — that is, replace 3x4> by 3x{Ex A (p) and Va;0 by \/x{Ex (j)) — and 
consider questions of the form ?(Exi A ... A Exn A 0), where the 
free variables of 0. Theorem |] can then be applied after the reduction. 

In the rest of this paper, we will restrict ourselves to constant domain models. 

2 A Syntactic Characterization of Answerhood 

Groenendijk and Stokhof's entailment relation between questions, as discussed 
in the previous section, allows us to define a notion of answerhood. 

^ Cf. Nelken and Francez for one of the few competing theories in this respect. 
* In Sect. ^, we will see exactly which such counterintuitive predictions are made by 

the theory. For now, the reader can use Theorem ^ to check that the theory indeed 

makes these predictions. 



Definition 1 (Answerhood). Let 7(f) be a question and a first order formula 
without free variables. We say that ip is an answer to 7(j) if7(t> |= 7ip. 

According to this notion (also termed licensing by Groenendijk [Q), "Everyone is 
going to the party" (VxPx) is an answer to "Who is going to the party?" {IPx) , 
because IPx |= IMxPx. (We wiU generahze to sets of questions in Sect. |2.2| .) 

Note that, under this definition, any contradiction or tautology counts as an 
answer to any question. Groenendijk and Stokhof |Q, j^] define a stricter notion of 
pertinence^ which excludes these two trivial cases by formalizing Grice's Maxims 
of Quality and Quantity, respectively. In this paper, however, we will stick to the 
simpler criterion of answerhood as defined above, which corresponds to Grice's 
Maxim of Relation. 

We now have a semantic notion of answerhood, telling us what counts as 
an answer to a question. However, for practical purposes, it is useful to have 
also a syntactic characterization of this notion. Can one give a simple syntactic 
property that is a necessary and sufficient condition for answerhood? As we will 
see in a minute, one can. 

First, let us look at a partial result discussed by Groenendijk and Stokhof 
and Kager Define rigidity of terms and formula instances as follows. 

Definition 2 (Rigidity). A term is rigid if it is composed of variables and 
rigid function symbols. A formula (j) is a rigid instance of another formula ij) if 
(j) can be obtained from -0 by uniformly substituting rigid terms for variables. An 
identity statement s K,t is rigid if the terms s and t are rigid. 

For example, if c is a rigid constant and / is a rigid function symbol, then rigid 
instances of Pa; include Pc, Px, Pf{c), and Pf{x). The identity statement cwx 
is also rigid. Notice that rigid instances are not necessarily rigid: If c is rigid, 
then Red is a rigid instance of Rxd even if the constant d is not rigid. 

Groenendijk and Stokhof 1^ and Kager observed that rigid instances of 
a question constitute answers to that question. By a simple inductive argument, 
one can generalize this a bit. 

Definition 3 (Development). A formula -ij) is a development of another for- 
mula (j) ( written "(j) < " j if ip is built up from rigid instances of (p 0,'nd rigid 
identity statements using boolean connectives and quantifiers, or if tp isT or _L. 

For example, if c and d are rigid constants, then Pc A Pd and 3x{Px A ^{x ~ c)) 
are both developments of Px. 

Theorem 2. If(p<i> then Icp |= lip. 

Proof. By induction on the size of (p. □ 

Using the translation procedure in Sect.0, together with Craig's interpolation 
theorem for first order logic, we can prove the converse as well. 

Theorem 3. Let y be the free variables of some formula ip. If 7(p lip, then 
there exists some formula with no free variables beside y such that (p <"& and 
X 1= Vy(V' ^ i^)- 



Proof. First, we will prove the special case where (f) is an atomic formula, say Px. 
Suppose that 7Px lip{y). Then, by Theorem |l|, 

Vf(Pf ^ F'f),x,x* h Vy(V'(y) ^ rm ■ (11) 

As a fact of first order logic, we can replace the universally quantified variables 
in the consequent by some freshly chosen constants c. This results in 

yx{Px^P'x),x,X*^i^{c)^r{c) . (12) 

and, from this, 

Vf (Ff ^ P'f), x\r{c)hx^ ^(c) ■ (13) 
By Craig's interpolation theorem for first order logic, we can construct an inter- 
polant ?9(c) such that 

Vf(Pf ^ P'x), x*,-^*{c) h ^{c) , (14) 

m\-x^ m , (15) 

and the only non- logical symbols in i?(c) are those occurring on both sides of (p^). 
From the way the translation procedure (•)* is set up, it follows that the only 
non-logical symbols that x* ^-nd ip* on the one hand, and x and on the other 
hand, have in common, are rigid function symbols. Thus, i9(c) contains no non- 
logical symbols beside P, c, and rigid function symbols. 

Removing primes uniformly from all predicate and function symbols in ([T^), 
we get X \= V'(c) ^(c)- From (|l5|), we get the converse: x H ^(c) — ^ V'(c)- 
Together, this gives us 

X h ^(co ^ m ■ (16) 

Since the constants c do not occur in Xi we can replace them by universally 
quantified variables. This results in 

X h Vy(V(y) ^ m) • (17) 

Furthermore, '&{y) contains no non- logical symbols beside P and rigid function 
symbols. From this, it follows that '&{y) is a development of Px. 

As for the general case, suppose T(j){x) lij}. Choose a fresh predicate 
symbol P with the same arity as the number of free variables of <j). Then it 
follows that IPx \=xf\y3(P3-^4>{S)) '^V'- Apply the above strategy to obtain a 
development -d of IPx such that x ^ '^x{Px ^ (pix)) |= Vj/('!9 ^ Let be 
the result of replacing all subformulas in of the form Pz by (f>{z). Then is a 
development of (f>, and x H '^?/(V' ^ t?')- 1^ 

Thus, the syntactic notion of development corresponds precisely to the semantic 
notion of entailment between questions. 

Recall that the formula x in Theorem ^represents an assertion in the common 
ground. If no assumptions on the common ground are made (i.e., x — T), then 
Theorem ^ reduces to the following syntactic characterization of answerhood. 

Corollary 1. ip is an answer to ?</> iff ip is equivalent to a development of <j). 

This syntactic characterization is useful for several purposes. First of all, it 
makes possible a thorough investigation of the predictions made by Groenendijk 



and Stokhof 's theory of answerhood: It shows what their semantic theory reaUy 
amounts to, syntacticaUy speaking. 

Second, this result opens the way to practical question answering algorithms. 
Now that we have a syntactic characterization of answerhood, we can address the 
question answering problem purely in terms of symbolic manipulation without 
having to refer to the semantics. This will be the topic of Sect.^. 

Another advantage is that it becomes possible to compare different theories of 
questions and answers. For instance, while at first sight Prol og a nd the partition 
theory of questions seem incomparable, we will see in Sect. 3^ that they are in 
fact closely related. 



2.1 Languages without Equality 

What if we do not have equality in the language? This is an interesting question 



for at least two reasons. First, as we saw in Sect. 1.2, a number of counterintuitive 
predictions of the partition semantics involve equality. For example, sentences 
such as -^{j « b) ("John is not Bill", where j and b are interpreted rigidly) and 
3x3y^{xKiy) ("There exist at least two entities") are answers to every question. 
Removing equality would prevent these predictions, though then answers like 
"Only John is going to the party" would no longer be expressible. 

The second reason why eliminating equality from the language is interesting is 
more practical. In question answering algorithms, it is convenient not to have to 
deal with equality, since equality reasoning is very expensive. From the theorem 
proving literature, one can conclude that dealing with equality is not feasible for 
many practical applications. 

As it turns out, the syntactic characterization result is even simpler for first 
order languages without equality. The corresponding notion of development of (j) 
is simply a formula built from rigid instances of (j) using boolean connectives and 
quantifiers, or T or _L. The same proof of Theorem^ goes through, since Craig's 
interpolation theorem holds regardless of whether equality is present. 



2.2 Multiple Questions 

The notion of answerhood that we introduced earlier (Definition generalizes 
trivially to multiple questions: If '0 is a first order formula without free variables, 
we say that -0 is an answer to ?^ if I'P \= lip. The notion of development 
(Definition H) also generalizes: A formula ip is a. development oi^P iiip is built up 
from rigid instances of elements of P and rigid identity statements using boolean 
connectives and quantifiers, or if is T or _L. The proof of Theorem ^ then 
generalizes directly to multiple questions, as does Corollary |l|: ip is an answer 
to ?^ iff i/j is equivalent to a development of ^. 

Multiple questions arise naturally in two settings. First, a query like "Who 
got invited to the party, and who is going to the party?" corresponds to a set 
of questions like ?<? = {7Ix,7Px}. According to the above generalization of 
answerhood, one answer to this query is Vx(/a; Px): "Everyone invited is 
going, and vice versa." 



Second, suppose that, as part of their common ground, the questioner and the 
answerer both know the complete true answers to a certain set of questions 70. 
For example, if it is commonly known who got invited, then 70 contains 7Ix. 
Extend the notion of entailment in (^) to handle questions as contexts in the 
following way: If x is a first order formula with no free variables, then 

?(? ,0 7ij iff VM = {W, D, I) : (18) 

M h X, UO]m ^W^^ mM Q Mm . 

Intuitively, given the common ground 70, a formula -0 is an answer to a ques- 
tion 7(j) just in case the entailment 7(j) \^70 7ip holds. Indeed, the entailment 

7Px \=7i^ 7\fx{Ix ^ Px) (19) 

is valid: If it is commonly known who got invited, then in response to "Who is 
going to the party?" one can answer "Everyone invited is going, and vice versa." 
In fact, we have 

7^ \=^;>e ?V' iff 76', hx ^V- ■ (20) 

In particular, 

?</) h?e iff ?6),?0h?V'- (21) 

This result means that asking a question 7(j} under some common ground 70 is 
exactly like asking the set of questions 70 U {?(/)}U under no common ground, 
in terms of what counts as an answer. So multiple questions arise again 0. 

We are about to present a question-answering algorithm. For simplicity, we 
will assume that there is only a single question to answer, and that the context 
is empty. The algorithm we will present generalizes trivially to handle multiple 
questions at once, and hence to handle questions in the context.^ 



3 Finding the Answer to a Question 

The general task of question answering is the following. 

Given a finite first order theory U and a question 7(f>, find an answer 
to 7(j) that is entailed by E and that is as informative as possible. 

In other words, we wish to find an informative formula t^i, such that S \^ ij) 
and 7(j) \= 7ip. Here the theory S is intended to capture the answerer's private 
knowledge, and we measure how informative an answer is by its logical strength: 
One answer is more informative than another if the former entails the latter. For 
example, an answerer who knows that John is going to the party and that it 
rains {U — {Pj, R}) might reply to "Who is going to the party?" {7(f> = 7Px) 
with the statement "Someone is going to the party" (t/i = 3xPx). If the constant 

^ In passing, we mention that the semantics of multiple questions can be reduced to 
that of single questions, provided that the domain contains at least two elements. 
More specifically, any two questions ?(^i and 702 are equivalent to the single question 
7 (^{xKiy /\(j)i)\J {-<{xP^y) /\cf>2)) ■ This reduction is of practical importance to the reader 
who encounters a fairy that promises to answer a single eternal burning question. 



symbol j is rigid, then the statement "John is going to the party" (Pj) would 
be an answer and preferred because it is more informative. 

The first question that comes to mind when considering the task of question 
answering is: Is there always an optimal answer? In other words, given a finite 
theory S and a question 7(f), is there always (modulo logical equivalence) a unique 
most informative answer to 7(f) entailed by IJ7 The answer is no. 

Theorem 4. There is a finite theory S and a question 7(f) such that, for every 
answer to 7(f> entailed by S, there is a strictly more informative answer to 7(f) 
entailed by U. 

Proof. Let U be the theory 

{\fxyz{x <yAy<z—^x<z), Vx-i(x < x), \fx3y{x < y)} (22) 

("< is an unbounded strict order"), and ipn {n G IN) be the formula 

3x1 .. . 3a;„ A,j<„; -^i^r « Xj) (23) 

("there are at least n different objects"). Every ■0„ is entailed by S. Furthermore, 
every if>n is an answer to the question ?T. (This follows either via Theorem^, or 
directly from how question entailment was defined in (Q).) The optimal answer 
would have to entail each ipn and, furthermore, contain no non-logical symbols. 
It follows from the compactness theorem that there is no such formula. □ 

Equality is not essential to this counterexample. Indeed, the same argument 
goes through if we replace the equality sign « by a non-logical binary relation /, 
use 7Ixy as the question, and extend the theory E with replacement axioms ^ : 

— S U { \/xIxx, \/xyzu{Ixz A lyu Ax<y^z<u), ^^^^ 
\/xyzu{Ixz A lyu A Ixy — > Izu) } . 

Moreover, in the absence of equality there is a second problem, concerning un- 
decidability. It is undecidable whether the given formula is the most specific 
answer to a question entailed by a theory. This follows from a simple reduction 
argument: Suppose we do not have equality in the language, and suppose the 
theory S does not contain any rigid function symbols. Then S is satisfiable iff T 
is the most specific answer to ?T entailed by S. But as we all know, first order 
satisfiability is undecidable. 

Notwithstanding these negative results, it is possible to construct a sound 
and complete question answering algorithm. The output of the algorithm is a 
sequence of answers that is cofinal in the set of all answers entailed by the 
theory S. Cofinality means that, for any answer that is entailed by S, there is a 
more informative answer in the sequence generated by the algorithm. Formally: 

Definition 4 (Cofinality). A set of formulas ^ is cofinal in another set of 
formulas ^ if each formula in ^ is entailed by some formula in (p. 

We call a question answering algorithm sound if it generates only formulas that 
are answers to the question and that are entailed by the theory. We call the 



algorithm complete if the sequence it generates is always cofinal in the set of all 
answers to the question entailed by the theory. A question answering algorithm 
with these properties is depicted in Fig.n^. 



To show that there are indeed algorithms satisfying these constraints, we 
will now discuss a rather trivial algorithm. Recall that, to answer a question 
given a theory E, it suffices to find developments of (f> that are entailed by S. 
The search for answers can be conducted using any theorem proving technique, 
including tableaux. A first stab at a question answering algorithm, then, is to 
syntactically enumerate and check all potential answers ip. 

Algorithm 1. To answer the question 7<j> given the theory S, repeat in dovetail 
fashion for every development ip oi (p: 

1. Initialize a tableau with a single branch, consisting of S and ^ip. 

2. Keep applying tableau expansion and closure rules. 

3. If the tableau becomes closed, report ip as an answer. □ 

This algorithm is sound and complete, because tableau-based first order theorem 
proving is. However, it is also horribly inefficient, since it considers every devel- 
opment of (/> as a potential answer, without taking the theory E into account. 

The challenge, then, is to construct an question answering algorithm that is 
sound and complete (like Algorithm ^ yet as efficient as possible. In the next 
section, we will make a start by providing a more intelligent algorithm. 

3.1 Tableau-Based Question Answering 

We will now introduce a sound and complete question answering algorithm based 
on free variable tableaux Without loss of generality, we assume that the 
question l(p consists of a single atomic formula Px, called the answer literal ^ . 
(If that is not so, simply add the formula \/x{Px ^ (p>) to the theory, where P 
is a new predicate symbol and x are the free variables of (p.) We also assume 
that the theory E is Skolemized; that is, any existential quantifier in E has been 
eliminated with the help of additional, non-rigid function symbols. Finally, we 
will disregard equality for the moment, but the algorithm can be extended to 
deal with equality as well (we will return to this issue at the end of this section). 

Table | summarizes the tableau calculus for first order logic without equality 
given by Fitting ||]. (We omit the existential rule since the theory is Skolemized.) 




Soundness: \ii : E \= ipi &l l(p \= lipi 
Completeness: \/xp : E \= ip &c l(j> \= Ixp 3i : ipi \= ip 



Fig. 1. A question answering algorithm 



Table 1. Tableau expansion and closure rules for theorem proving 



Conjunction 



A V 



Quantification 



where y is fresh 



~"Plx/y] 



Negation 




Closure 



A branch is closed if it contain a formula and its negation. 

A tableau is closed if all its branches are closed. 

A substitution a is closing if the tableau is closed after applying cr. 



In the algorithm that we will introduce shortly, besides the usual tableau expan- 
sion rules, one other operation is allowed. At any stage of tableau construction, 
one is allowed to create a copy of the question — renaming its free variables to 
new variables — and either add the copy to all branches or add its negation to 
all branches. We term this operation the Add Instance rule, shown in Table 0. 



Note that, whereas the rules in Table |l| are only applied to a single branch 
at a time, the Add Instance rule is always applied to the entire tableau. This 
difference is not essential — completeness is not lost if we restrict the application 
of the Add Instance rule to single branches — but it increases efficiency and it 
renders the generated answers more concise. 

Example 1. A partially developed tableau is given in Fig.^, where we are given 
the theory S = {{Pa A Pc) V {Ph A Pc)} and wish to answer the question IPx. 
Note the use of the Add Instance rule, indicated by "!". Also note that y := c is 
a closing substitution: After applying it, the tableau is closed. 

Every closing substitution generates an answer, in the following way. Suppose 
we face a partially developed tableau T, and it can be closed by a substitution a. 
Let be the (finite) set of all instances of the question that have been added so 
far using the Add Instance rule. Then /\<P'^ is an appropriate answer. In fact, 
we can even do a little better and give a slightly more informative answer, by 
only considering those added instances that participate in closure. 

Definition 5 (Closure). Let T be a tableau, a be a substitution overT, and k 
be a set of formula occurrences in T . The pair (cr, k) is a closure if contains, 
on each branch, some formula and its negation. 



Table 2. The Add Instance rule for question answering 



Add Instance 



Py --Py 

where IPx is the question, and y are fresh variables 



(Pa A Pc) V {Pb A Pc) 



Pa APc PbA Pc 



Pa 
Pc 



Pb 
Pc 



-^Py I -^Py ! 
Fig. 2. A partially developed tableau 



For every closing substitution a, there is at least one closure {a, k). Let consist 
of exactly those formulas in ^ that appear in k. With ANS(cr, k) we will denote 
the result of universally quantifying all free variables in ~^ ■ 

Example 2. Consider again the tableau in Fig. ^. Let a be the substitution y := c, 
and let k contain the occurrences of Pc and -'Py on both branches. Then the 
pair {a, k) is a closure. It generates the answer ANS((t, k) = -^^Pc. 

When generating answers from closures, we will only be interested in the most 
general closures. One closure (cti, ki) is more general than another {a2, K2) if ci 
is more general than a2 and ni is a subset of K2- Closures that are more general 
do as little as possible besides closing the tableau, and so generate answers that 
are more informative. When there is more than one most general closure, we 
compute all the corresponding answers and take their conjunction. 

One remaining difficulty concerns rigid function symbols. According to our 
syntactic characterization of answerhood, non-rigid constants and function sym- 
bols are not allowed to occur in generated answers. For instance, suppose the 
theory is {Pc}, where c is a non-rigid constant. To the question ?Px, we must 
answer 3xPx rather than Pc. Moreover, Skolem functions created during tableau 
expansion are also considered non-rigid and disallowed in answers. For instance, 
given the theory {3xPx} and the question ?Px, we must answer 3xPx, rather 
than "Ps, where s is the Skolem constant such that Ps" . Our algorithm achieves 
this by applying Chadha's unskolemization procedure 

The complete question answering algorithm is as follows. 

Algorithm 2. To answer the question IPx given the theory E, start by initial- 
izing a tableau for S. Next, do one of the following, repeatedly, ad infinitum. 

1. Apply a tableaux expansion rule (Table I). 

2. Apply the Add Instance rule (Table ||). 

3. Take the conjunction ■00 — A ANS(it, k) over all most general closures (cr, k). 
Unskolemize -i/jo to remove any non-rigid or Skolem function symbols, and 
report the result as the next answer. □ 

Theorem 5. Algorithm^ is sound and complete, provided the non- deterministic 
choices are made in a fair manner. 



Proof sketch. Call a formula tp a pre-answer if ijj is built up from instances 
of the answer literal Px — allowing non-rigid and Skolem function symbols — 
using boolean connectives and quantifiers, or if -;/; is T or _L. We first prove that 
the algorithm, minus unskolemization, generates pre-answers in a sound and 
complete manner. Soundness means that the algorithm only generates (yet-to- 
be-unskolemized) conjunctions "00 that are pre-answers and that are entailed by 
the theory. Completeness means that the sequence of conjunctions generated is 
cofinal in the set of all pre-answers entailed by the theory. 

Soundness: Because ANS(cr, k) is always a pre-answer, so is any generated 
conjunction ipo- Moreover, any closure (cr, k) for our question-answering tableau 
gives rise to to a closed theorem-proving tableau for Z'U(^k)°^- Since tableau the- 
orem proving is sound, the theory S must entail ANS((T, k). Hence S entails ipQ. 

Completeness: Suppose that is a pre-answer entailed by U. By completeness 
of tableau theorem proving, we can find a closed theorem-proving tableau for 
S U and systematically transform it into a question-answering tableau 

for (17, 7Px) that generates a pre-answer ipo entailing ip. Given this, it can be 
shown that in fact any fair question-answering tableau expansion procedure will 
eventually generate a pre-answer entailing ip. 

We now need to relate pre-answers to answers. The unskolemization proce- 
dure is sound; that is, unskolemizing ipo always gives a formula entailed by ipQ. 
Besides, unskolemizing any pre-answer gives an answer, so Algorithm^ is sound. 
The unskolemization procedure is also complete; that is, whenever a pre-answer tpo 
entails some answer ip, the result of unskolemizing ipo also entails ip. Besides, 
every answer is a pre-answer, so Algorithm ^ is complete. □ 

One way to guarantee fairness in Algorithm ^ is to implement it with depth 
first iterative deepening, just as Beckcrt and Posegga Q did in their lean 
prover. In fact, we have modified lean to become a lean question answerer. 

The difference between Algorithms |l| and ^ is that the latter waits until 
closing the tableau before deciding which answer to prove to follow from the 
theory. In other words. Algorithm ^ does not commit to the rigid instances that 
constitute the development until they are determined by the closure. This ability 
to postpone commitment is exactly the strength of free variable tableau calculi 
as compared to ground tableaux 

This algorithm can be extended to deal with equality in two steps. First, 
add the tableau rules necessary for theorem proving with equality |^ . Second, 
generalize the Add Instance rule, so that not only instances of the question but 
also (in)equalities can be added to the tableau. 

3.2 Prolog as a Special Case 

A precise connection can be established between Prolog and our algorithm. Pro- 
log performs question answering in a sense more restrictive than considered here, 
because it makes extra assumptions about the theory S and the question 7(j): 

1. The theory S is required to be in Skolemized Horn form and the question 
must consist of a single atom, in order to make computation feasible. 



2. There is no equality predicate in the basic language of Prolog. 

3. Prolog assumes that all function symbols and constants are rigid, making Pc 
a potential answer to 7Px even if the symbol c resulted from Skolemization. 

4. Due to its depth-first search strategy, Prolog is complete only for some the- 
ories, for example theories without cycles among predicate symbols. 

Subject to all these restrictions, Prolog is an optimal question answering algo- 
rithm: Given a theory and a question, it produces answers optimal in the sense 
of the partition theory of questions.^ 

As we can see, Prolog is a question answering algorithm that makes many 
extra assumptions. The present work makes it possible to distinguish and identify 
these assumptions, and eliminate them. A broad spectrum of generalizations of 
Prolog can then be considered. 



4 Conclusion 

This paper makes two contributions. First, we presented a syntactic characteriza- 
tion of answerhood for the partition semantics of questions. The applications of 
this result are mainly internal to the partition semantics: It explains the meaning 
of a question in terms of the form of its answers. 

Second, our tableau-based question answering algorithm connects two im- 
portant research traditions: question answering systems such as Prolog, and the 
formal semantics of natural language questions. We feel that the link between 
these two fields of research has been neglected in the past, and hope to bring 
them together. 

We want to mention three directions of further research (we stress the third). 

Logical Some theoretical issues are still to be addressed: How can answerhood 
be characterized syntactically when the semantics allows varying domains? 
Also, is it decidable whether a given answer is the optimal answer to a 
question entailed by a theory? Without equality, this problem is undecidable, 
as we proved in Sect.|3| We have yet to find a similar reduction argument for 
the case with equality. 

Linguistic We want to bring our theoretical results to bear on the semantics of 
questions in natural language. In particular, our syntactic characterization 
result clarifies the linguistic predictions made by the partition semantics . 

Computational We want to use this work as a unifying framework to compare 
different approaches to question answering. In particular, we want to inves- 
tigate a variety of Prolog generalizations and determine which assumptions 
made by Prolog are feasible to drop. 

Like us. Green and Luckham and Nilsson have applied theorem 
proving to question answering, but their criteria for answerhood are tied 

® One apparent difference between Prolog and our algorithm is that Prolog is based on 
resolution, whereas we used tableaux. However, the two methods are closely related, 
and Prolog can be interpreted as a variant of so-called connection tableaux [[L0[. 



to the syntax of formulas in prcncx normal form. This difference explains 
why their algorithms omit unskolcmization, a step necessary for soundness 
under the notion of answcrhood we adopt here. Also, Bos and Gabsdil Q 
have devised a simplified version of the partition semantics for computational 
purposes. These efforts seem to fit nicely in our picture. 
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